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Method and system for ch^ter matker and title boundaiy insertion in DV video 
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TECHNICAL FIELD 

The present invention relates to a method for obtaining a data recording, such 
as a (digital) video recording, on a first medium, such as a DVD, fiom a data stream 
originating fix>m a second medium, such as a digital video tape, the data stream comprising a 
5 plurality of data segments or scenes each having a different recording start time. The method 
comprises genemting a recording segment of the data recording on the first medium based on 
a determination of a duration of a present recording segment 

In a &rlher aspect; the present invention relates to a recording system for 
obtaining a data recording on a first medium ftom a data stream originating fitnn a second 

10 medium, the data stream comprising a plurality of data segments each having a different 
recording start time, the recording system comprising input means for receiving tiie data 
stream from the second medium, output means for storing the data recording on the first 
mediimi, and processing means connected to the input means and output means, which 
processing means are arranged for generating a recording segment of the data recording on 

IS the first medium based on a determination of a duration of a present recording segment. 

BACKGROUND ART 

American patent q>plication US2002/0168181 describes a method and device 
for digital video c^ture. A video recording is split into several files, based on a set of 

20 criteria. The criteria conqnise a detection of a change in a video scene and the time duration 
of a video recording. When the video scene changes, as detected by image processing 
techniques, it is assumed that anew scene (a dlGferent event) starts, and consequentiy a new 
file is g^erated Alternatively, when a scene takes too long, and no scene change is detected, 
a new file is also initiated This method and device have the disadvantage that every scene 

25 change will lead to the generation of a new file, which may lead to a very large number of 
separate fil^ originating firom a single recording. 
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SUMMARY OF THE INVENTION 

The present mvention seeks to provide an improved indexing method and 
system, in particular suited for the recording of video.data. 

According to a jBbrst aspect of the present invention, a method according to the 
5 preamble defined above is provided, in which a new recording segment is generated when a 
recording time discontinuity exceeds a threshold value, the recording time discontinuity 
being a difference between a recording end time of a first data segment and a recording start 
time of a next data segment. By only starting a new data segment when the recording time 
discontinuity exceeds a threshold value it is possible to provide an efiBcient index marker 
10 insertion in a data recording, and too large a numiber of index marls^ insertions is prevented. 
In digital video, index markers such as chapter maikers are used to indicate the start of a new 
data segment 

The present invention may be implemented in two manners, 'on the fly' and 
"pre-scan'. When using the present invention in the *on the fly' embodiment it is unknown 

IS what data is still to be recorded (time of recording, number of scene changes, etc.). In a 
further embodiment, using the *on the fly' alternative, the threshold value is a fimction 
d^endent on a desired recording segment duration and the present recording segment 
duration. By properly selecting the threshold value function, in which the threshold value is a 
predefined function in time, it is possible to prevent too large a number of index marker 

20 insertions, even when the properties of the data to be recorded is unknown (*on the fly ')• 
In an embodiment of the present method, the new recording segment is 
generated by insertion of index markers of a first type in the data recording on the first 
medium. In digital video recording applications, the index markers of the fibrst type are called 
ch^ter markers. Adding index markeis is a simple operation in digital video processing, 

25 which does not require many resources in the data processing. 

In a fiirther embodiment the thr^old value function is a continuously 
decreasing funcriGn in tima. This can b3 a iinssTy quadrsnc. eKponantiai or otnert3?pe ot 
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fh(t) = fill - a2 * (t - (C+l)*d) for (C+0.5)*d < t < (C+1.5)*d; 
th(t) = 0 for t > (C+1 .5*d), 

in which C is a count of the index Darker of the first type, al is a first linear coefBcient, and 
a2 is a second linear coefficient This function will try to obtain index marker insertion at 
5 fixedinteivals in time of C * d, but allows an early of late insertion depending on the 
recording time discontinuity. 

In an even fiirther embodiment, especially suited for the *pre-scan' alternative, 
the mefliod fiirther conoprises a pre-scan of the data stream to obtain the recording time 
discontinuities in the data stream. By knowing the number of discontinuities of a data stream 
10 before starting tiie actual recording, it is possible to choose Ifae number o:^ and tixe positions 
of the index marker insertions in a logical and efficient manner. 

A subset of recording time discontinuities may be selected from all detected 
recording time discontinuities as starting points for a new segment, for which the value of 
CMIps is iniimnized. The parameter CMIps is given by : 



IS in which 



CMI^^ = C • (l - coverage)-^ I • imbalance 



coverage — 



^deltas 
s 



is a coverage property of the data recording, with 

deltac = difference in recording start time of recording segment c and 
recording end time of the previous recording segment c; 

dehoa = difference in recording start time of data segment s and recording end 
20 time of the previous data segment s; and 

imbalance = Y\dur^ - avrdw] 

c 

is an imbalance property of the data recording, with 

avrdur == predefined average recording segment duration; 
duTc = duration of recording segment c; 

25 and. 

... C = a predefined constant weight fsictor for the coverage properly; 



I = a predefined constant weight &ctQr Ibr the ifnbalance property. 
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The aim is to obtain an imbalance value as close to zero as possible, and a 
coverage value as close as possible to one. 

In a forHier embodiment of the present invention, the method further 
comprises translation of selected index markers of the first type into index markers of a 
second type, called title boundaries in digital video recording based on a predetermined set of 
criteria. The index markers of the second type may be recorded in the table of contents 
(TOC) of a DVD, thus allowing to select a title boundary in order to start a playback of that 
part of the data recording. Changing the index marker of the first type into an index maiker of 
the second type is a sinq>le and efficient operation. 

In a further aspect, the present invention relates to a recording system as 
defined in the preamble above, in which the processing means are further arranged fin- 
generating a new recording segment generated when a recording time discontinuity exceeds a 
threshold value, the recoiding time discontinuity being a difference between a recording end 
time of a first data segment and a recording start time of a next data segment, in which the 
threshold value is a function dependent on a desired recoiding segment duration and the 
present recording segment duration. The processing means may further be arranged to 
execute the activities of the present method. The recording system according to the present 
invention provides advantages associated with the advantages described above in relation to 
the present method. 

Iq an even further a^ect, the present invention relates to a computer jnrog^nmi 
product, such as a CD-ROM or other data carrier, for obtaining a data recording on a first 
medium firom a data stream originating fix>m a second medium, the computer program 
product comprising con:^>uter executable code, which, when loaded by a coniputer sjfstem, 
provides the con^uter system with the fiinctionality of the present method A general 
purpose computer system, provided with suitable inter&ces for receiving the data stream and 
for storing the data recording, can thus be transferred in a recording system. 
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Fig. 3 shows a flow diagram of two possible embodiments of flie preset 

invention; 

Fig. 4 shows a plot of a Ihreshold value function accoiding to an embodiment 
of the present invention; and 
5 Fig. 5 shows a plot of the inserted ch^ter markers in the data recording using 

associated threshold value functions. 

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS 

In Fig. 1, a schematic diagram is shown of a set-up of arecording system 1, 

10 e.g. a DVD recorder, comprising processing electronics 2, local memory 3 connected to the 
processing electronics 2, and a first recording medium 4, in this case a DVD disc. The 
processing electronics 2 and local memory 3 cooperate to provide the functionality of the 
recording ^tem 1. The recording system 1 may be connected to a (video) data source S, e.g. 
a DV camera, to record video footage from the DV camera from a second recording mediumf 

1 5 (e^g. a DV tape) to the first recording medium 4. This process is called capturing. When 

capturing the footage a titie is created A titie is a playable entity that has an entry in a table 
of content (TOC) associated with the first recording medium 4. Hie user can access the TOC 
and select a titie to play. The TOC may consist of key-frames, small icon pictures 
representing the titie. 

20 For one capturing session, one titie is created The titie may be as long as the 

playtime of tiie tape 5. The drawback of this is that the video footage of the whole tape S is 
accessible as one single unit from the TOC. Usually, the video footage on the t^e S consists 
of several events, recorded at dififerent moments in time. The user may want to have direct 
access to the video footage belonging to these events. For this two accras methods exist 

25 Through the TOC, the user can select a titie (through a key-fiame) and play this titie directiy. 
Within a titie, the user can directiy navigate to chapters. Ch^ters are subdivisions of tities. 
By pressing "nexff or ^previous* the user can continue the playback at a next titie. 

The present invention relates to a method for automatically dividing video 
footage firom a camcorder 5 into tities and chapters. For tiiis purpose, the Recording Date & 

30 Time (RD&T) of the video footage is used The video footage consists of scenes. A scene is a 
piece of contiguous raxnding. When a recording is interrupted, a current scene is ended and 
a new scene is started The start of tiie new scene has a later RD&T than the end of tiie . 
current scene. This iF(^ed'an^D&T-discontinuity, or more general, a recbrdic^ ttme 
di^ntinuily. 
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A tiile boundary should give access to an event (for example abiirfhday or a 
day out). Usually, scenes lhat are recorded close in time, and that are recorded sequentially 
on the camcorder 5, belong to one event A big RD&T discontinuity in between groups of 
scenes (for example several days) corresponds to a boundary between events. Therefore, the 
5 &st order criterion for title boundaries is the size of the discontinuity. A second order 
criterion is that tities should be of equal length. 

Within a titie, navigation is through chapter markers. Chapter markers are best 
divided equally over time and should best be aligned at starts of scenes. Scenes with big 
discontinuities are preferred as they are more likely to give access to separate sub-events. 
10 First order criterion is equality of lengtii and second order aiterion is size of the 
discontinuity. 

In Fig. 2, an example is given of a data stream 10 originating fiom the DV 
t^e 5. In the figure, locations of titie boundaries (Tji and TjtH-1) and chapter markers (C_m 
and CjDi+l) are indicated. DeltaKD&T indicates the size of the discontinuity between 
IS scenes. 

For example: A tape 5 could contain various events of which one is a birtiiday. 
The last scene before the birthday was recorded 5 days before the birthday. All birthday 
scenes are recorded on the birthday, while the first scene after the birthdays is recorded 3 
days later. The birthday scenes belong to Titie n. Within the birthday a number of chapters 
20 are formed, based on the length of the scenes in a cha|>ter. 

In Fig. 3, a flow diagram is shown of two possible embodiments of the present 
method The present method for obtaining an indexed data recording on the DVD 4 is done in 
two steps. Firsts index maricers of a first type, or ch^iter markers, are inserted in step 1 6. In 
the following step 17, a translation is performed of selected chapter markers into titie 
25 boundaries (index marioecs of a second type). 

The reason for not inunediately inserting tide boundaries, but to translate 
seiecied chsptar msrksxB is tworoid: 
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Optionally, step 16maybeprecededby afiirtherstep 18, in which a pre- 
scaimiDg of fhe tape S is perfocned. This has the potential advantage that all Ihe video 
material is known beforehand, such that a better positioning of chapter markers can be made. 
Without pre-scanning, the method for adding chapter markers is called the "Qn-the-fly 
5 algorithm". With pre-scanning, the method for adding chapter mark^ is called the "Pre-scan 
algorithm". 

The "On the fly algorithm" inserts chapter markers while capturing the video 
material. With the '"On the fly algorithm", chapter markers have to be inserted, based on 
knowledge of the video material up to the point of insertion. It is not know how much video 
1 0 material is to be recorded totally, nor is anything know about the KD&T information in tiie 
video material yet to come. 

The decision to insert a ch^ter marker at some point is based on the following 

criteria: 

1. The amount of ch^ter markers inserted so &r 
IS 2. The closed time since the recording was started, 

3. The presence and magnitude of an RD&T discontinuity 

Objectives are to catoh the big discontinuities and to keep the distance 
between chapter markers equal and close to a desired value. 

These criteria are expressed in a threshold fimction. If an RD&T discontinuity 
20 is present and its magnitude exceeds the threshold then a dbapter marker is inserted. A very 
simple threshold function would be a constant of for example 2 hours. Any KD&T 
discontinuity that exceeds two hours would cause a chapter marker to be inserted. Such a 
threshold flmction would only satisfy the third criterium above. 

Assume that a number of charter maikers C has been inserted so &r. Assume 
25 that <f is the desired chapter duration, e.g. 15 minutes. If all ch^ters have the same lengih 
then every d units of time a new ch^ter is inserted. Idealty, the (C+1)^ ch^ter marker is 
placed at / = (C+1)*£/. 

Now let the threshold function be th(t), with a shape as defined in Fig. 4. The 
following cases may be discerned when placing ch^ter marker C+1: 
30 1. t<C^d 

This is even before the position ^ere diaqpter marker C would have been inserted 
ideally. The threshold level is high, but is decreased as / (P+X)^d is ^>proached. 
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2. ^>C*rfand^=<(C+l)*</ 

The ideal position for chapter marker C+1 is being ^proached. The fhieshold is 
decreased 

3. />(C+1)*^ 

5 The ideal position of chapter marker C+1 has akeady passed. The threshold is furtiier 

decreased until zero at ^ = (C+1.5)*rf. 

The threshold function in Fig. 4 may also be expressed as a combination of 
two linear functions using the following mathematical expressions: 
th(t) = tho - al * (t - C*d) for t < (C+0.5)*d : a first linear coefficient al is used; 
10 th(t) = thl - a2 * (t - (C+l)*d) for (C+0.5)*d< t < (C+1.5)*d : a second linear coeflBdent a2, 
smaller than al is used; 
th(t) = 0 for t > (C+1.5*d). 

In Fig. S, an example is shown how the chs^ter markers ace inserted during a 
recording using the above described embodiment In the plot, the threshold value th(t) over 
1 5 time during a recording is shown. The horizontal axis is elapsed time while recording. The 
vertical axis is the KD&T value. The Ihick line shows the actual threshold while recording is 
ongoing. The arrows pointing iq>wards from the horizontal axis are KD&T discontinuities. 
The circles on the horizontal axis are chapter markers. 

• At ^ = 1 .S'^d the first chapter marker is inserted. Because no discontinuity exceeded the 
20 ^ threshold^ a chapter marker is inserted when the thre^old become 0. The new 

threshold function for C>=1 becomes effective. 

• Shordy after t=2*d the second chapter marker is inserted, because an KD&T 
discontinuity exceeds the threshold. Chapter marker 2 is inserted. The new threshold 
fimction for C=2 becomes effective. 

25 « At ^ is close to 3*d anollier KD&T discontinuity exceeds tiie tiir^old. Chapter 
marker 3 is inserted. The new thr^hold fimction for C=6 becomes effective. 
<^ Shorflv after t = iihe fourth chau^ m3jt:er is inserted, bacsusa an "PJj&T 
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When inserting chapter markers and title boundaries in a recording, there are 
certain criteria to the positioning of the ch^ter markers. These criteria can be described 
using mathematical formulations of relevant parameters. 

Firstly, the ch^ter markers must be well distributed over closed time, which 
S can be formulated using the parameter imbalance. 

imbalance= ^ (1) 

in which 

to/d2ir= total duration of video material 
avrdur = predefined average chapter duration 
dtttc = duration of chapter c 
10 The value of imbalance should be as close as possible to 0. As the parameter 

totdur is a constant for a specific data recording, this parameter could be left out in formula 

(1). 

Secondly, it is an aim to optimise the ratio of the time coverage of the original 
dta segments or scenes of the data stream, and the time coverage of the eventual chapters in 
15 the resulting data recording. This ratio can be desc^ed by the following formula: 



coverage 



Zdeltac 



^deltas (2) 

s 



with 



deltac = delta KD&T of ch^ter c 

deltas - delta RD&T of data segment or scene s 
20 A delta KD&T is the difference between the KD&T of the video at the start of 

the scene/chapter and the KD&T of the video at tiie end of the jxrevious scene/ch^ter. The 
value of coverage should be as close as possible to 1. 

In Fig. 3 an alternative embodiment of the present invention is shown, 
including a step 18 in which the ori^nal data stream is pre-scanned in order to obtain all 
25 recording time discontinuities beforehand. Execution of the pre-scan algorithm starts by 
collecting of all KD&T discontinuities from c^ytured video material For example, if the 
video material is c^tured using DV tq)e, then KD&T discontinuities can be collected by 
fest-forwaJ3fing fixOT flxe begmning iq) to the end of the DV tape.(RD&TJnfiMroation is 
embedded in the DV stream). 
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The problem of ch^ter marker insertion (CJMI, step 16), which represents the 
second phase of the pie-scan algoriflun, can be then formulated using equations (1) and (2) in 
the following way. From the set of all detected RD&T discontinuities, a subset has to be 
selected that will minimize the equation (3). 

CMI =C'{}.-coverage)+l imbalance (3) 



where: 

C = a predefined constant (weight factor for coverage property) 
I = a predefined constant (weight fector for imbalance property) 
When a minimal value of CMIps is found, all currentiy selected KD&T values 
will become chapter markers. 

Formulated in such way the CMI problem belongs to the group of 
combinatorial optimization problems that are, ag^in, part of more general group of nonrlinear 
optimization problems. It is well known that non-linear optiboiization problems can't be 
solved using analytical methods. So, in order to solve i1^ a heuristic method can be used. 
What is interesting about this problem is that the value of the global tninimntri of CMIps is 
known and equal to 0. This is atheoretical mininrnm^ it is not certain tiiat a solution exists for 
this minimum. The knowledge of Ihe theoretical mitiiTnnm can be very well used, while 

executing pre-scanalgori&ma;^ estimatefhe quanty oftheicurreiit som ~ '* 

It was decided to use a canonical version of the genetic algorithm (GA) (see 
"Genetic Algorithms in Search, Optimization and Machine Learning", D.E. Goldberg, 
Addison- Wesley, ISBN 0-201-15767-5) for solving the CMI problem (other, more 
complicated, versions of GA may be also used). In generation n (iteration n) of GA various 
genetic operators (selection, cross-over, mutation) are executed, sequentially, on the current 
GA population n in order to create new population n-M (from generation n+l). This process 

KCi3te[raxloil^3S^3 b^oir^olutiozi from c:i'^^T%fr^r^T*r\Wcir^T, ^7rm,rn-H78fv eosik rl^i?nemti^jr- 
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RD&T values are put in one array fhen a simple fainaxy string (array) can be used to address 
one possible KD&T subset. This is the simplest way to represent the solution of CMI 
problem. It is also well suited representation for canonical version of GA. 

The GA has to be able to easily compare two solutions of the CMI problem. 
For this pmpose we can use equation (3). 

The following GA operators can be used: 

- as selection: tournament selection, 

- as cross-over: one point crossover, 

- as mutation operator: binary mutation with the small mutation probability. 

Other, more complicated, operators can also be used. Note that this proposal 
doesn't guarantee that the global minimimi of the CMI problem will be reached. 

The final phase of the prraoat invention (step 17 in Fig. 3) canbe ^yplied to 
both embodutnents described above. The titie boundary insertion is only done after the video 
footage scene information is known within the system. Therefore, a pre-scan algorithm can 
be used The criteria as in defuied above for the imbalance and coverage parameters can be 
used. The difference is that chapters take the role of scenes/data segments and that tities take 
the role of chapters. This can be done because only chapter markers are candidates for titie 
boundaries. Titie boundary insertion at a place where no ch^ter marker exists, is prohibited. 
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CLAIMS: 



1 . Method for obtainiiig a data lecoiding cxn a first medium fiom a data stream 
originating from a second medium, the data stream consgprising a plurality of data segments 
each having a different recording start time, 

the method con:q>rising: 
5 generating a recording segment of the data recording on the first medium 

based on a detemunation of a duration of a present recording segment, 

characterized in that a new recording segment is generated when a recording 
time discontinuity exceeds a threshold value, Ihe recording time discontinuity being a 
difference between a recording end time of a first data segment and arecoiding start time of a 
1 0 next data segment 

2. Method according to claim 1, in which the threshold value is a function 
dependent on a desired recording segment duration (d) and the present recording segm^t 
duration. 

15 

3. Method according to claim 1, in which the new recording segment is generated 
by insertion of index markers of a first type in the data recording on tiie first medium. 

4. Method according to claim 1 , in which the threshold value jEunction is a 
20 continuously decreasing function in time. 

5. Method according to claim 4, in which the threshold function con^ndses a 
combination of two linear functions in time: 

th(t) = tiio-al *(t-C*d)fort<(C+0,5)*d; 
25 tii(t) = thl - a2 * (t - (C+l)*d) for (C40.5)*d < t < (C+l,5)*d; 
tii(t) = Ofor t>(C+1.5*d), 

in which C is a count of the index marker of Ihe first type, *al is a first linear coefBcient, and 
a2 is aseccm d linear coefficient 
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6. Method according to claim 1 , fiirther comprising a pie-scan of tibe data stream 
to obtain the recoiding time discontinuities in the data stream. 

7. Method according to claim 6, in which a subset of recording time 

5 discontinuities is selected from all detected recording time discontinuities as starting points 
for a new segment, for which the value of CMIps is minimized, 
CMI = C • (l — coverage)^ I • imbalance 

in which 



Y^deltac 

coverage = 

^ Y^deltas 
s 

is a coverage property of the data recording, with 

deltOc = difference in recording start time of recording segment c and 
1 0 recoiding end time of the previous recording segment c; 

deltas = difference in recording start time of data segment s and recording end 
time of the previous data segment s; and 

imbalance = L|^Wc avrdur\ 

c 

is an imbalance property of the data recording, with 

* - - - ' " a^^ — " " " 

15 duTc — duration of recording segment c; 



and 



C = a predefined ccmstant weight factor for the coverage properly; 
I = a predefined constant weight factor for the imbalance properiy. 



on p 



Method accoiding to claim 1, in which fee method further comprises 
iiansMi-DB of Klccied 7prl~rmT^"^> of xhe fkzt twje wto mfi^~ rn^"^ of a cacond £vi?5 
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means for storing the data recorduig on the first medium (4), and processing means (2, 3) 
comiected to flie iapat means and output means, wMch processing means are ananged for 
generating a recording segment of the data recording on the iBrst medium (4) based on a 
determination of a duration of a present recording segment, characterized in that the 
processing means (2, 3) are further arranged for generating a new recording segm^it 
generated when a recording time discontinuity exceeds a threshold value, the recording time 
discontinuity being a difference between a recording end time of a first data segment and a 
recording start time of a next data segment 

10- Recording system according to claim 9, in which the processing means are 

further ananged to execute the activities of the method according to one of the claims 2 
through 8. 

11. Ckmiputer program product for obtaining a data recording on a first niedium 

(4) fiom a data stream originating fiom a second medium (5), the computer program product 
compdsing computsx executable code, which, when loaded by a computer system, provide 
the con^uter system with tiie functionality of the method according to one of the claims 1-8. 
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ABSTRACT: 



Method and tecordixig systsem for obtaining a data lecording on a first mediunoi, 
such as a DVD, ftom a data stream originating from a second medium, such as a DV tape. 
The data stream comprises a number of data segments each having a different recording start 
time. In the present invention, which may be used 'on the fly' and in combination with a pre- 
scan, a recording segment of the data recording on the first medium is generated based on a 
determination of a duration of a present recording segment A new recording segment is 
generated when a recording time discontinuity exceeds a ttireshold value, the recording time 
discontinuity being a difference between a recording end time of a first data segment and a 
recording start time of a next data segment 
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